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SUMMARY 



The pirimary objective of the proposed research was to investigate 
the effects on learning of various stimulus-response (S-R) mapping 
schemes. It was hypothesized that S-R connections are not simply arbi- 
trary that the way stimulus and response classes are mapped together 
may have a profound effect on learning rate. To study this problem, two 
paradigms based on discrimination learning procedures were constructed. 

The first paradigm made possible qualitative variations in mapping 
the study of one-one, many-one, and one-many S-R connections 
while holding constant across conditions the number of required associa- 
tions. Selected possible outcomes were theoretically related to type of 
mapping variation and the development of cognitive processes with age. 
The results obtained were puzzling. One order of difficulty was found 
for age 12 and another for age 14. The theorizing engaged in was not 
capable of handling such a result. 

rGsults of the second study are more intelligible. The para- 
digm used in this study made possible the quantitative variation of S-R 
mapping on an order- disorder dimension. (In less technical language, 
this is equivalent to numerically varying the number of exceptions to a 
classification rule.) The results indicated that one degree of mapping 
disorder (one exception) produced about the same number of errors as a 
completely Irregular mapping. But the difference between one degree of 
ma pping disorder and zero degrees of mapping disorder (a classification 
rule with no exceptions) was extremely large: the task was trivial when 

there were no exceptions and it became quite difficult with only one 
exception. A theory based on hypothesis sampling and generation was put 
forth to explain the observed results. 



INTRODUCTION 



The S-R Mapping Problem 

Stimulus-response (S-R) connections, or mappings^ bulk large in 
contemporary psychology, both experimentally and theoretically, but sur- 
prisingly little is known about the relation between various mappings 
and learning efficiency. A few examples will help make clear the nature 
and depth of the problem. 

Fries (1963) and Gibson (1963) have pointed out that one of the 
difficulties inherent in learning to read the English language resides 
in the irregular mapping relations that occur between the printed alpha- 
bet letters (stimuli) and the phonemes of spoken utterances (responses). 
The printed words "red", "read ”, "read ”, and "reed” Illustrate this 
nicely. "Red” is a printed symbol for a color, and "readj^” is a printed 
symbol for what has been done with a book. They are different stimuli 
but they have the same spoken response. In contrast "read 2 " is a symbol 
for what will be done with a book, and therefore "read^” and "read^ are 
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Identical stimuli, but they call for different responses. To further 
complicate matters, "reed" combines with "read2" to give another 
instance of different stimuli having the same response. In practice of 
course, experienced readers suffer little from the effects of such 
mapping irregularity because they are sensitive to the context. But 
beginning readers are not likely to be so fortunate. It is for this 
reason that phonemic alphabets, like augmented Roman (Downing, 1965), 
are of great potential importance: they allow the beginning reader to 

work with a one-one regular mapping between written and spoken forms 
before transferlng him to the irregular mappings that exist between the 
\jr£tten and spoken forms of a natural language like English. 

Another example concerns the problem of machine translation of 
language (say, Russian to English) and the problem of second language 
learning. Lamb (1965) points out that there exists no simple one-one 
mapping between words in one language and words In another. If there 
were such a mapping, the problem of machine translation would be rldl- 
culously easy. All that would be needed is a computer with fairly .. 
large memory storage and a concordance between the "source" language 
(e.g. Russian) and the "target" language (e.g. English). Also in this 
particular case since the mapping is one-one the procedure is completely 
reversible. As Lamb claims, there Is obviously no one-one mapping be- 
tween languages at the level of words , or perhaps at any other level. 

For a machine translation to be adequate, it must take into account 
language structure as well as individual units, and any talk of struc- 
ture would have to deal with at least constituent analysis (Gleason, 
1961) and transformations (Chomsky, 1957). At these more abstract 
levels of analysis it is still not too likely that very many one-one 
mappings can be found between a source language and a target language. 

In any case one-o»'e mappings are not necessary: many-one mappings will 

be perfectly adequate. The problems arise when one-many relations are 
found at some level of analysis; additional machinery is required to 
sort out which of the "many" is most appropriate. 

Now it is possible to consider second language learning, at least 
in its initial stages, as human translation between one language and 
mother. If this argument is accepted, It Is at once clear that one- 
oue, ma ny-one, and one-many mappings and their associated problems 
apply to second language learning just as surely as they do to machine 

translation. 

A final detailed example of an experimental nature will suffice to 
show the magnitude of effects attributable to mapping differences. 
Goldstein and Weber (1965) used the old Lashley (1938) conditional dis- 
crimination paradigm as a point of departure. There were four displays 
x/ab, x/ba, y/ab, y/ba. Each display had three highly distinctive non- 
sense forms on it. The two elements 'a' and 'b' were arrayed horizon- 
tally on the display and were offered as alternatives for choice. The 
third element, either 'x' or 'y', appeared at the top of the display 
and was a symbol or cue determining the correct choice. In other words 
the correct response was made conditional on the particular cue item 
present in the display. For one group whenever cue item 'x' was pre- 
sent ''a' was the correct or reinforced choice, irrespective of the 
position of 'a', left or right, in the display. Whenever cue item y 



was present 'b' was the correct cl’olce, again irrespective of the posi» 
tlon of 'b'. Hence in this group a particular cue form was made to 
correspond to a particular response form. But in another group, posi- 
tion of the response rather than its form was the relevant feature. 

Thus in the presence of cue 'x' the left hand response element (irre^- 
spective of whether it was 'a' or *b') was the correct choicej and in 
the presence of cue 'y' the right hand element (again irrespective of 
its form) was correct. Striking differences in learning occurred under 
these two schemes of cue-nesponsc mappings When the correct response 
was based on mapping form-to-form thf* task was trivially easy for ^s 
(everyone of the high school ^s learned, ano mean errors to a 20-error- 
less-trial criterion was 3.7 T. But when the correct choice was based 
on a form-to-position mapping, the task became quite difficult (3 out 
of 10 Ss failed to learn in 1280 trials; and mean errors to criterion 
for the 7 learners was 45.1). Now both groups of ^s saw exactly the 
same displays in exactly the same order; the sole difference in treat- 
ment for the two groups resided in the way cue-response mappings were 
set up. In the easy case the mapping was based on form-form pairing 
and in the difficult case on a form-position pairing. 

For various reasons involving the impossibility of strictly ortho- 
gonal comparisons and the impossibility of numerical variation in degree 
of S-R mapping, the Goldstein and Weber experiment and procedure is not 
ideal for the systematic study of S-R mapping. (And such was not its 
intention). Nonetheless it indicates that the way correct responses 
and their corresponding stimuli "dovetail" or "map" together can have 
a potent effect on ease of learning. 

There are, of course, other relevant references, and they would 
include the following: Fitts and Seeger (1953), Shaffer (1965), Weber 

(1965), and Weber ard Woodward (1966). 

The following two experimental paradigms serve to make more precise 
the notion of S-R mapping and supply needed detail on the research con- 
ducted. 

EXPERIMENT I 

One-one. Many-one, and One-many S-R Mappings 

A study was made of effects on acquisition due to various S-R 
mappings. A one- one mapping means that for each stimulus or cue item 
there is one, and only one, corresponding correct response item; con- 
versely, each correct response is conditional on one, and only one, cue 
Item. An example of this is the standard paired associate task in 
which number of cue itiems and number of response items are equinumerous. 
A many-one mapping pieans that several different cue items correspond to 
a single correct response. 4n example of this is afforded by the paired 
associate work of Bower (1961) in which there was a total of ten differ- 
ent stimulus items corresponding to only two different response Items. 

A one-many mapping means that each stimulus item corresponds to several 
porrect response items. Examples of this idapping in the paired asso- 
ciate literature are difficult to find. About the closest approximation 
is Osgood's (1949) transfer paradigm in which ^s first learn and 
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then S--R«. But this transfer task Is probably not directly relevant 
here because the present task does not Involve transfer! and by using 
discrimination learning techniques Ss need only recognize rather than 
recall a correct choice. The Recognition task means th&t ^ can specify 
on each trial the response options available to It Is thus possible 
to have a pure ons-many task when discrimination procedures are used. 

This Is In contrast to the paired associate procedure In which 2 or 
more recall responses correspond to a single cue item: In this case 

can thwart our experimental manipulations by using only one of the 
responses and thereby collapsing the task to a simple one-one mapping. 
Discrimination procedures also have the advantage of enabling one to 
hold constant the minimum number of required associations among the 
various types of S-R mapping. This will be made tnore clear In the Method 
section. 



Method 



Subjects . Two age groups, 12 year olds and 14 year olds, were 
employed. They were randomly assigned to conditions, 10 per group, for 
6 groups giving a total of N«72. 

Design and Procedure . Table 1, Illustrates the experimental de- 
sign. Letters of the alphabet serve to represent distinctive visual 
nonsense forms (Goldstein and Weber, 1965). Each display contains one 
cue Item (w,x,y, or z) at the top of the displays and two response Items 
(a,b*, and/or c,d — depending on the group) at the bottom of the display 
(Goldstein and Weber, 1965). For group 1-1 there Is a one-one mapping 
between cue Items and correct response Items (correct choices are repre- 
sented by underlining In Table 1). In other words each cue Item has 
associated with It one and only one correct choice, and each correct 
response has associated with it one and only one cue Item. For group 
M-1 there Is a many-one mapping between cue Items and correct response 
Itemsj several (two^) cue Items are associated with each response Item. 
Finally for group 1-M there Is a one-many nwipplng between cue and res- 
ponse items; each cue item Is associated with several (two)^ correct 
response items. It is Important to note that the number vfour) of dis- 
tinct associations between cue and response Items Is constant across 
the various mapping conditions; hence number of associations Is not 
confounded with type of mapping. However, there Is a partial confound- 
ing between mapping type and number of distinct visual forms as denoted 
by letters of the alphabet. This number Is eight for group 1-1 and six 
each for groups M-1 and 1-M, and It Is reflected by the different un- 
certainty values at the bottom of Table 1. 

The eight displays of each mapping condition appearc'd In successive 
randomized blocks. The displays were on 4- X 6-ln. cards. ^ responded 
by pointing to one or the other of the two choice Items. JE provided 
reinforcement by saying "Right" or "Wrong". A noncorrection procedure 
was employed. Data consisted of error scores. 
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TABLE 1 



Display Content and Reinforcement Assignment 



a 



Mapping Scheme 



1-1 



M - 1 



1 - M 



w/j^ b 
w/b a 
x/a b 
x/b a 
y/£ d 
y/d c 
z/c ^ 
z/d c 



w/a b 
w/b, a 
x/a b 
x/b a 
y/a b 
y/b a 
z/a b 
z/b a 



w/£ b 
w/b £ 
x/a b 
x/b a 
w/£ d 
w/d £ 
x/c d 
x/d c 





Cue Items 


2 


2 1 


Uncertainty 








In Bits 


Response 




1 2 




Items 


2 


1 ^ 




Cue and 

Response Ijtems 


3 


2.58 2.58 



®Note: Underlining Is employed to schematically Indicate correct 



choices. 



RESULTS 

The major results are presented In the ‘ 

error score per S as a point of departure we note thst for *8® 

Group Many-one Is least difficult (32.0 errors) and Group 
Lst dWficult (53.2 errors). This result corresponds to "ha* 
termed the -transfer surface hypothesis- In the ^^scus^ 

The result makes good sense until the findings tor Age 

For Age 14 all groups are essentially equivalent In Mean errors, 

ranging from 34.3 to 39.0. This result corresponds to what he 

termed the "equality hypothesis" and also -“hes sense theoretically. 

There seems to be an Interaction between age and kind of stimulus 
There seems co p mapping Is an effective variable at age 

response mapping. The type or mapping ^ to under- 
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TABLE 2 





Descriptive 


Error Statistics as a 
of Age and Mapping 
(N * 10 per Group) 


Function 




Mapping 


Age 




Mean 


SD 


Mdn 


Many- one 


12 




32.0 


21.42 


21.0 




14, 




36.7 


17.54 


38.5 


One- one 


12 




42.3 


17.61 


44.5 




14 




34.3 


16.42 


33.0 


One-Many 


12 




53.2 


9.83 


54.0 




14 




39.0 


18.04 


36.0 



Perhaps the most difficult part of making any meaningful Interpre- 
tations arises from the large variability encountered. TI\1 q reflects 
the fact that a number of Ss did not perform at better than a chance 
level. The standard deviations are correspondingly large. 

In terms of statistical effects the large variabilities again com- 
plicated matters. An analysis of variance was conducted and neither 
the main effects (age and type of mapping) nor the interaction produced 
an F ratio approaching significance at the .05 level. 



Discussion 



If we use the symbol "x<y" to indicate that group x is less diffl 
cult than group y, then at least the following outcomes would possess 
psychological meaning in the sense that arguments could be adduced for 
them. 



(1) 1-1 < 1-M, M-1. This would be the outcome expected on the 
basis of the rationale behind the augmented Roman alphabet (Downing, 
1965), i.e., learning ought to be better when the S-R mapping Is 1-1, 
as in phoneme- letter matching. We might term this the augmented Roman 
hypothesis. 

(2) 1-M < M-1 < 1-1. An outcome of this form might be expected 
if Ss were "response bound", that is, Ss are morr likely to observe a 
display item if they are required to respond to It (Lashley, 1938). 

If this Is true, then Ss ought to learn 1-M easier than M-1 because the 
former has the most uncertainty associated with response items and the 
latter with cue items. Finally group 1-1 ought to be most difficult 
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because its response uncertainty Is greater than or equal to that of the 
othev groups, and the uncertainty associated with cue items is also 
greater than or equal to that of the other groups. What would be parti- 
cularly interesting here is If there were an interaction with type of 
mapping and the age variable. Younger ^s might be more "response bound" 
than older ^s. Weber and Woodward (1966) have, for example, found that 
college students are quite proficient in observational learning of com- 
plex discrimination problems. In short, younger ^s might show the order 
of difficulty 1-M < M-1 < 1-1, while older ^s wouTd display some other 
order of difficulty, because they extract information from displays 
irrespective of whether they respond to a particular item or not. Hy- 
pothesis (2) will be termed the "response bound" hypothesis. 

(3) M-1 < 1-1 < 1-M. An outcome of this form would be expected 

if Osgood's (19‘V9) transfer surface were somehow applicable. As pointed 

out In the introductory section of Experiment I, the applicability of 

the transfer surface Is not at all clear cut. If it were, then we would 

expect group 1-M to be most difficult because, learning a list 

and then transfering to a list S. - R« often produces negative transfer. 

Similarly, group M-1 ought to be easiest because - Rj^ followed by 

S- - R. often produces positive transfer, and group 1-1 ought to be 

intermediate because the learning of - R, followed by 

be the control condition for assessing whetner positive or negative 

transfer occurred in the other groups. This is the transfer surface 

hypothesis. 

(4) 1-1 ■ M-1 * 1-M. It is entirely possible that all groups 

would be of equal difficulty. This might be because the mapping varia- 
ble as proposed here might be too easy and lead to a ceiling effect in 
which all ^s readily learn. It is for this reason and others, that two 
different age groups were used. The ceiling effect might very well 
occur in the older age group but not in the younger: there might be an 

interaction effect between the two main effects, type of mapping and 
age of _Ss. This is the equality hypothesis. 

The r isults suggest that the transfer surface hypothesis would 
somehow be applicable for age 12, while the equality hypothesis would 
be most applicable at age 14. But in the absence of statistically 
significant effects these conclusions must be very tentative. 

EXPERIMENT II 

Numerically Varied S-R Mapping Disorder 

Only a brief account of this study is given since it proved publish- 
able and reprints are attached. 

Mapping disorder (exceptions to a classification rule) was numeri- 
cally varied. Zero degrees of mapping disorder (a classification rule 
with no exceptions) proved trivially easy. One degree of mapping dJ s- 
order produced a dramatic increase in mean errors. Degree of mapping 
disorder was related a priori to a linear variable (number of different 
correct choices) and to a quadratic variable (conditional mapping un- 
certainty U (R)). Mean errors were significantly related to only the 



7 



quadratic component. Other results Included: a closet relation between 

U (R) and standard deviations than between U (R) and means, within 
gllups Unique patterns of errors related to Sapping 
corrLpondence between post-experimental subjective , 

task structure and difficulty. Finally, 

of mapping effects were considered. An explanation phrased in terms 
of mapplij uncertainty and hypothesis storage, sampling, and generation 
gave the best account of the obtained results. 



CONCLUSIONS AND RECOMMENDATIONS 

It seems clear that mappings between stimulus and response classes 

are extremely Important for such practical concerns as «ad^ ttudles 
tlon second language learning, and language translation. The studies 
in tWs contract were designed to get at potentially important mapping 
variables at a theoretical level. 

It is apparent that the results of the first study ("0“®-°“®' 
one, and One-many S-R Mappings") are ambiguous: no significant dlffe - 

ences occurred with respect to either task or age level. This is no 
doubt due to the difficulty of the task because a number of ®“^J®®*® ^ 
each condition failed to learn. With ®‘‘-""J“®. 

meaningful (words, high-meaning nonsense syllables, etc.) more 8“®)®®^ 
would learn, and the results might fit mucn better with one of 
theoretical outcomes discussed. 

Experiment II ("Numerically Varied S-R Mapping Disorder") was an 
unqualified success. It shows clearly that even one exception to a 
clLsificatlon rule is enough to severely disrupt learning, ais sug- 
eests further experimentation on how exceptional material ought to b 
progra^rd. For example. In the case of spelling rules 
Lould the rule be mastered before the exceptions are 
would happen to rule learning if exceptions were introduced 
rule instances at the beginning of learning? In short, are spel^ng 
rules learned better in the absence of exceptions, and, if so, how 

much better. 
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